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We combine searches by the CDF and DO Collaborations for the associated production of a Higgs 
boson with aWoiZ boson and subsequent decay of the Higgs boson to a bottom-antibottom quark 
pair. The data, originating from Fermilab Tevatron pp collisions at yfs = 1.96 TeV, correspond to 
integrated luminosities of up to 9.7 fb _1 . The searches are conducted for a Higgs boson with mass 
in the range 100-150 GeV/c 2 . We observe an excess of events in the data compared with the 
background predictions, which is most significant in the mass range between 120 and 135 GeV/c 2 . 
The largest local significance is 3.3 standard deviations, corresponding to a global significance of 
3.1 standard deviations. We interpret this as evidence for the presence of a new particle consistent 
with the standard model Higgs boson, which is produced in association with a weak vector boson 
and decays to a bottom-antibottom quark pair. 
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The standard model (SM) Higgs boson H is pre- 
dicted to be produced in association with aWorZ boson 
at the Fermilab Tevatron pp Collider if it is within kine- 
matic reach, and its dominant decay mode is predicted to 
be into a bottom-antibottom quark pair (bb), if its mass 
m,H is less than 135 GeV/c 2 HQ. An observation of this 
process would support the SM prediction that the mech- 
anism for electroweak symmetry breaking, which gives 
mass to the weak vector bosons, is also the source of 
fermionic mass in the quark sector. The leptonic decays 
of the W and Z vector bosons and the decays of the H to 
bb provide distinctive signatures of Higgs boson produc- 
tion, which are used to discriminate signal events from 
the copious backgrounds In this Letter, we combine 
the searches from the CDF and DO Collaborations for 
H bosons produced in association with a vector boson, 
with subsequent decays H bb. Both collaborations 
consider the processes WH — > £vbb, ZH —> £ + £~bb, and 
WH, ZH -)• $ T bb 0-EH (where £ is either eor^i and $ T 
denotes missing transverse energy (l2|). and separately 
combine results within their collaborations [13J, [l4( . This 
is the first publication of a combination of CDF and DO's 
searches for H — > bb, which is based on the preliminary 
findings reported in Ref. fislj . 
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Much is known about the Higgs boson from other ex- 
periments. The direct searches at LEP2 in the e + e~ — > 
ZH(-^r bb) mode, with a small contribution from vector 
boson fusion, are very similar to those combined here, 
and exclude SM Higgs boson masses below 114.4 GeV/c 2 
at the 95% confidence level (C.L.) 0. Direct searches 
for VH -> Vbb at the LHC, where V = W or Z [U EH , 
do not yet constrain the allowed SM Higgs boson mass 
range. Including other search modes, direct searches 
at the LHC for the SM Higgs boson limit its mass to 
be between 116.6 and 119.4 GeV/c 2 or between 122.1 
and 127.0 GeV/c 2 , at the 95% C.L. [H, Within 
these searches, both LHC experiments observe local ex- 
cesses above the background expectations for a Higgs 
boson mass of approximately 125 GeV/c 2 . With ad- 
ditional data and analysis improvements, the LHC ex- 
periments confirm these excesses and observe a particle 
with properties consistent with those of the SM Higgs 



boson [2l|. Much of the power of the LHC searches 



comes from gg —> H production and Higgs boson de- 
cays to 77, W + W~ , and ZZ, which probe the couplings 
of the Higgs boson to other bosons. In the allowed mass 
range, the Tevatron experiments are particularly sensi- 
tive to VH production with H — > bb, which probes the 
Higgs boson's coupling to b quarks. We search for Higgs 
bosons of masses 100 < mjj < 150 GeV/c 2 and inter- 
pret our results independently of searches which are not 
sensitive to the specific Higgs boson production and de- 
cay modes studied here. We also report results assuming 
m H = 125 GeV/c 2 . 

Higgs boson signal events are simulated using the 
leading order (LO) calculation from PYTHIA 22j, with 
CTEQ5L (CDF) and CTEQ6L1 (DO) [H parton distri- 
bution functions (PDFs). We normalize our Higgs boson 
signal-rate predictions to the highest-order calculations 
available. The WH and ZH cross section calculations are 
performed at next-to-next-to leading order (NNLO) pre- 
cision in QCD and ncxt-to-leading-order (NLO) precision 
in the electroweak corrections Q . We use the branching 
fractions for Higgs boson decay from Ref. Q • These rely 
on calculations using hdecay [24| and PROPHECy4f [25| ■ 
Assuming the m# = 125 GeV/c 2 hypothesis, we expect 
approximately 155 Higgs boson signal events to pass our 
selection requirements, along with 9.2 x 10 4 background 
events from all other SM sources. 

We model SM and instrumental background processes 
using a mixture of Monte Carlo (MC) and data-driven 
methods. For CDF, backgrounds from SM processes 
with electroweak gauge bosons or top quarks are mod- 
eled using PYTHIA, ALPGEN df, MC@NLO [111, 
and HERWIG [28]. For DO, these backgrounds are mod- 
eled using PYTHIA, ALPGEN, and COMPHEP 
An interface to PYTHIA provides parton showering and 
hadronization for generators without this functionality. 

Diboson ( WW, WZ, ZZ) MC samples are normalized 
using the NLO calculations from mcfm [33] . For tt, we 
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use a production cross section of 7.04±0.70 pb 13 if. -which 
is based on a top-quark mass of 173 GeV/c 2 [32j and 



MSTW 2008 NNLO PDFs [33j. The single-top-quark 



production cross section is taken to be 3.15±0.31 pb [34 [. 
Data-driven methods are used to normalize the W/Z plus 
light-flavor and heavy-flavor jet backgrounds 1351 using 
W/Z data events containing no 6-tagged jets [36|, which 
have negligible signal content 13|, |l4| . 

The CDF and DO detectors are multipurpose solcnoidal 
spectrometers surrounded by hermetic calorimeters and 
muon detectors and are designed to study t he p rod- 
ucts of 1.96 TeV proton-antiproton collisions [371 [jif . 
All searches combined here use the complete Tevatron 
data sample, which after data quality requirements cor- 
responds to 9.45 fb _1 - 9.7 fb _1 ; the size of the analyzed 
data set depends on the experiment and the search chan- 
nel. The online event selections (triggers) rely on fast 
reconstruction of combinations of high-p^ lepton candi- 
dates, jets, and ~$ T . Event selections are similar in the 
CDF and DO analyses, consisting typically of a prese- 
lection based on event topology and kinematics, and a 
subsequent selection using ^-tagging. Both collabora- 
tions use multivariate analysis (MVA) techniques that 
combine several discriminating variables into a single fi- 
nal discriminant which is used to separate signal from 
background. Each channel is divided into exclusive sub- 
channels according to various lepton, jet multiplicity, 
and 6-tagging characterization criteria aimed at group- 
ing events with similar signal-to-background ratio and so 
optimize the overall sensitivity. Due to the importance 
of 6-tagging, both collaborations have developed multi- 
variate approaches to maximize the performance of the 
6-tagging algorithms. A boosted decision tree algorithm 
is used in the DO analysis, which builds and improves 
upon the previous neural network &-tagger [39j], giving 
an identification efficiency of f» 80% for &-jets with a 
mis-identification rate of « 10%. The CDF ^-taggin g al - 
gorithm has been recently augmented with an MVA |40| , 
providing a ^-tagging efficiency of w 70% and a mis- 
identification rate of ~ 5%. 

The reconstructed dijet mass provides discrimination 
between signal and background. The decay width of the 
Higgs boson is expected to be much narrower than the ex- 
perimental dijet mass resolution, which is typically 15% 
of the mean reconstructed mass. A SM Higgs boson sig- 
nal would appear as a broad enhancement in the recon- 
structed dijet mass distribution. The sensitivity is en- 
hanced by combining the dijet mass with other kinematic 
information using multivariate discriminants. The MVA 
functions are optimized separately for each sub-channel 
and for each hypothesized value of ttih in the range 100- 
150 GcV/c 2 , in 5 GeV/c 2 intervals. The results from each 
sub-channel are summarized in histograms of the MVA 
discriminants for the expected Higgs boson signals, the 
backgrounds itemized by source, and the observed data. 

We interpret the results using both Bayesian and mod- 



ified frequentist techniques, separately at each value of 
win- These methods are described in Refs. [l5, 41, 42| . 
These techniques are built on a likelihood function which 
is a product of Poisson probabilities for observing the 
data in each bin of each sub-channel. Systematic un- 
certainties are parametrized with nuisance parameters, 
which affect the rates of the predicted signal and back- 
ground yields in each bin. A nuisance parameter may 
affect the predictions of multiple sources of signal and 
background in multiple sub-channels, thus taking corre- 
lations into account. A nuisance parameter may also af- 
fect multiple bins' predictions by different amounts, thus 
parameterizing uncertainty in the shapes of distributions. 
Gaussian priors arc assumed for the nuisance parameters, 
truncated to ensure that no prediction is negative. The 
signal predictions used correspond to SM Higgs boson 
production and decay, scaled by a factor R for all bins 
of all sub-channels. By scaling all signal contributions 
by the same factor, we assume that the relative contri- 
butions of the different processes are as predicted by the 
SM. 

In the Bayesian technique, we assume a uniform prior 
in R and integrate the likelihood function multiplied by 
the priors of the nuisance parameters to obtain the pos- 
terior density for R. The observed 95% credibility level 
upper limit on R, Rg^ s , is such that 95% of the integral 
of the posterior of R is below Rg$ s . The expected distri- 
bution of i?95 is computed in an ensemble of simulated 
experimental outcomes assuming no signal is present. In 
each simulated outcome, random values of the nuisance 
parameters are drawn from their priors. A combined 
measurement of the cross section for Higgs boson pro- 
duction times the branching fraction B(H —> bb), in units 
of the SM production rate, is given by i? fit , which is the 
value of R that maximizes the posterior density. The 
68% credibility interval, which corresponds to one stan- 
dard deviation (s.d.), is quoted as the smallest interval 
containing 68% of the integral of the posterior. 

We also perform calculations using the modified fre- 
quentist technique 42j, CL S , using a log- likelihood ratio 
(LLR) as the test statistic: 



LLR= -2 In 



p(data|ifi) 
p(data|-£/o) ' 



(1) 



where Hi denotes the test hypothesis, which admits the 
presence of SM backgrounds and a Higgs boson signal, 
H denotes the null hypothesis, for only SM backgrounds, 
and 'data' are either simulated data constructed from the 
expected signal and backgrounds, or the actual observed 
data. The probabilities p are computed using the best- 
fit posterior values of the nuisance parameters for each 
simulated experimental outcome, separately for each of 
the two hypotheses, and include the Poisson probabilities 
of observing the data multiplied by Gaussian constraint 
terms for the values of the nuisance parameters. The CL S 
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technique involves computing two p-values, 

CL b = p(LLR > LLR ohs \H a ), (2) 

where LLR Q \y S is the value of the test statistic computed 
for the data, and 



CL s+b = p{LLR>LLR ohs \H 1 ) 



(3) 



To compute limits, we use the ratio of p-values, CL S = 
CL s+ b/CLb. If CL S < 0.05 for a particular choice of Hi, 
parametrized by the signal scale factor R, that hypothesis 
is excluded at the 95% C.L. The median expected limit is 
computed using the median LLR value expected in the 
background-only hypothesis. 

The uncertainties on the signal production cross sec- 
tions are estimated from the factorization and renormal- 
ization scale variations, which include the impact of un- 
calculated higher-order corrections, as well as uncertain- 
ties due to PDFs, and the dependence on the strong cou- 
pling constant, a s . The resulting uncertainties on the 
inclusive WH and ZH production rates are 7% [H . We 
assign uncertainties to the prediction of B(H — > bb) as 
calculated in Ref. J43|. These uncertainties arise from 
imperfect knowledge of the mass of the b and c quarks, 
a s , and theoretical uncertainties in the bb and W + W~ 
decay rates. 

The largest sources of uncertainty on the dominant 
backgrounds are the rates of tagged U+heavy flavor jets, 
which are typically 20-30% of the predicted values. The 
posterior uncertainties on these rates are typically 8% or 
less. Uncertainties on lepton identification and trigger 
efficiencies range from 2% to 6% and are applied to both 
signal- and MC-based background predictions. These un- 
certainties are estimated from data-based methods sepa- 
rately by CDF and DO, and differ based on lepton flavor 
and identification category. The 6-tag efficiencies and 
mistag rates are similarly constrained by auxiliary data 
samples, such as inclusive jet data or ti events. The un- 
certainty on the per-jet 6-tag efficiency is approximately 
4%, and the mistag uncertainties vary between 7% and 
15%. The uncertainties on the measurements of the in- 
tegrated luminosities, which are used to normalize the 
expected signal yields and the MC-based backgrounds, 
are 6% (CDF) Q and 6.1% (DO) @. Of these values, 
4% arises from the inelastic pp cross section, which is 
taken to be correlated between CDF and DO. 

To validate our background modeling and search meth- 
ods, we perform a search for SM diboson production 
in the same final states used for the SM H — >• bb 
searches. The NLO SM cross section for VZ times 
the branching fraction of Z — > bb is 0.68 ± 0.05 pb, 
which is about six times larger than the 0.12 ± 0.01 pb 
cross section times branching fraction of VH(H — > bb) 
for a 125 GeV/c 2 SM Higgs boson. The data sam- 
ple, reconstruction, process modeling, uncertainties, and 
sub-channel divisions are identical to those of the SM 
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FIG. 1: Background-subtracted distribution of the recon- 
structed dijet mass mjj , summed over all input channels. The 
VZ signal and the background contributions are fit to the 
data, and the fitted background is subtracted. The fitted VZ 
and expected SM Higgs (mu = 125 GeV/c 2 ) contributions 
are shown with filled histograms. 
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FIG. 2: Background-subtracted distribution for the dis- 
criminant histograms, summed for bins with similar signal-to- 
background ratio (s/b), for the H — » bb (ran = 125 GeV/c 2 ) 
search. The solid histogram shows the uncertainty on the 
background after the fit to the data as discussed in the text. 
The signal model, scaled to the SM expectation, is shown 
with a filled histogram. Uncertainties on the data points cor- 
respond to the square root of the sum of the expected signal 
and background yields in each bin. 



Higgs boson search. However, discriminant functions are 
trained to distinguish the contributions of SM diboson 
production from those of other backgrounds, and po- 
tential contributions from Higgs boson production are 
not considered. The measured cross section for VZ is 
3.9 ± 0.6 (stat) ±0.7 (syst) pb, which is consistent with 
the SM prediction of 4.4 ± 0.3 pb. 

The combined background-subtracted reconstructed 
dijet mass (nijj) distribution for the VZ analysis is shown 
in Fig. [TJ The VZ signal and the background contribu- 
tions are fit to the data, and the fitted background is sub- 
tracted. Also shown is the contribution expected from a 
SM Higgs boson with m H = 125 GeV/c 2 . 

To visualize the results produced by the multivariate 
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VH analyses, we combine the histograms of the final 
discriminants, adding the contents of bins with similar 
signal-to-background ratio (s/b). Figure [5] shows the sig- 
nal expectation and the data with the background (in- 
cluding VZ) subtracted, as a function of the s/b of the 
collected bins, for the combined Higgs boson search, as- 
suming ran = 125 GeV/c 2 . The background model is 
fit to the data, and the uncertainties on the background 
are those after the nuisance parameters have been con- 
strained in the fit. An excess of events in the highest 
s/b bins relative to the background-only expectation is 
observed. We also show the LLR as a function of m# in 
Fig. El along with its expected values under the hypothe- 
ses Hq and Hi, and also the hypothesis that a SM Higgs 
boson is present with m# = 125 GeV/c 2 . 

We extract limits on SM Higgs boson production as a 
function of run in the range 100-150 GeV/c 2 in terms of 
i?95 S , the observed limit relative to the SM rate. These 
limits are shown in Fig. 21 together with the median ex- 
pected values and distributions in simulated experimen- 
tal outcomes assuming a signal is absent. We also show 
the median expected limits assuming the SM Higgs bo- 
son, with m,H = 125 GeV/c 2 , is present. We exclude 
ran < 106 GeV/c 2 at the 95% credibility level, while 
our median expected limit on m# is 116 GeV/c 2 , if no 
signal were present. The exclusions obtained with the 
CL S technique match those computed with the Bayesian 
technique. 

The observed limits are weaker than expected due to 
an excess events in the data with respect to the back- 
ground predictions in the most sensitive bins of the dis- 
criminant distributions, favoring the hypothesis that a 
signal is present. We characterize this excess by com- 
puting the best-fit rate parameter R Rt , which, when 
multiplied by the SM prediction for the associated pro- 
duction cross section times the decay branching ratio 
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FIG. 3: The log-likelihood ratio LLR as a function of Higgs 
boson mass. The dark and light-shaded bands correspond to 
the regions encompassing 1 s.d. and 2 s.d. fluctuations of the 
background, respectively. The dot-dashed line shows the me- 
dian expected LLR assuming the SM Higgs boson is present 
at m H = 125 GeV/c 2 . 
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FIG. 4: The observed 95% credibility level upper limits on 
SM Higgs boson production (R95) as a function of Higgs boson 
mass. The dashed line indicates the median expected value 
in the absence of a signal, and the shaded bands indicate the 
1 s.d. and 2 s.d. ranges in which R95 is expected to fluctuate. 
The dot-dashed line shows the median expected limit if the 
SM Higgs boson is present at mu = 125 GeV/c 2 . 
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FIG. 5: The best-fit cross section times branching ratio 
(o~wh + ozh) x B(H — > bb) as a function of tuh- The dark 
and light-shaded regions indicate the 1 s.d. and 2 s.d. mea- 
surement uncertainties, and the SM prediction is shown as 
the smooth, falling curve with a narrow band indicating the 
theoretical uncertainty. The expected cross section fit values 
assuming the SM Higgs boson is present at ran = 125 GeV/c 2 
are shown with dot-dashed lines for the cases of the expected 
SM rate (dark blue) and the best fitted rate from data (light 
magenta) . 



(o'WH + czh) x B(H — > bb), yields the best fit value for 
this quantity. We show our fitted ((Jwh + &zh) xB(H —> 
bb) as a function of run, along with the SM predic- 
tion, in Fig. [5] The figure also shows the expected cross 
section fits for each m# assuming that the SM Higgs 
boson, with mjj = 125 GeV/c 2 , is present. The ex- 
pected fits are shown for both the expected SM rate 
and the best fitted rate from data, which corresponds to 
(a W H + <Jzh) x B(H -> bb) = 0.23±g;g| (stat + syst) pb. 
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FIG. 6: The p-value as a function of mu under the 
background-only hypothesis. Also shown are the median ex- 
pected values assuming a SM signal is present, evaluated sep- 
arately at each run- The associated dark and light-shaded 
bands indicate the 1 s.d. and 2 s.d. fluctuations of possible 
experimental outcomes. 
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The corresponding SM prediction for tuh = 125 GeV/c 2 
is 0.12 ± 0.01 pb. 

The significance of the excess in the data over the 
background prediction is computed at each hypothesized 
Higgs boson mass in the range 100-150 GeV/c 2 by cal- 
culating the local p-value under the background-only hy- 
pothesis using i? fit as the test statistic. This p-value ex- 
presses the probability to obtain the value of i? fit ob- 
served in the data or larger, assuming a signal is truly 
absent. These p- values are shown in Fig. [5] along with 
the expected p-values assuming a SM signal is present, 
separately for each value of win- The observed p-value 
as a function of tuh exhibits a broad minimum and the 
maximum local significance corresponds to 3.3 standard 
deviations at tur = 135 GeV/c 2 . 

The Look-Elsewhere Effect (LEE) [H 



47 accounts 



for the possibility of a background fluctuation affecting 
the local p-value anywhere in the tested run range. In 
the mass range from 115 GeV/c 2 (the prior bound from 
the LEP2 direct search [H]) to 150 GeV/c 2 , the recon- 
structed mass resolution is typically 15%, and the result- 
ing LEE factor is approximately 2. Correcting for the 
LEE yields a global significance of 3.1 standard devia- 
tions. Taking into account the exclusion limits for the 
SM Higgs boson mentioned earlier, there is no LEE and 
we derive a significance of 2.8 standard deviations for 
m H = 125 GeV/c 2 . 

We interpret this result as evidence for the presence of 
a particle that is produced in association with a W or 
Z boson and decays to a bottom-antibottom quark pair. 
The excess seen in the data is most significant in the mass 
range between 120 and 135 GeV/c 2 , and is consistent 
with production of the SM Higgs boson within this mass 
range. Assuming a Higgs boson exists in this mass range, 
these results provide a direct probe of its coupling to b 
quarks. 
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